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Abstract. I perform a joint counts- in-cells analysis of galaxies of dif- 
ferent spectral types using the Las Campanas Redshift Survey (LCRS). 
Using a maximum-likelihood technique to fit for the relationship between 
the density fields of early- and late-type galaxies, I find a relative linear 
bias of 6 = 0.76 it 0.02. This technique can probe the nonlinearity and 
stochasticity of the relationship as well. However, the degree to which 
nonlinear and stochastic fits improve upon the linear fit turns out to de- 
pend on the redshift range in question. In particular, there seems to be 
a systematic difference between the high- and low-redshift halves of the 
data (respectively, further than and closer than cz ~ 36, 000 km/s); all of 
the signal of stochasticity and nonlinearity comes from the low-redshift 
portion. Analysis of mock catalogs shows that the peculiar geometry and 
variable flux limits of the LCRS do not cause this effect. I speculate 
that the central surface brightness selection criteria of the LCRS may be 
responsible. 



1. Introduction 



Different types of galaxies have different large-scale density fields ( Hubble 193(: ; 
Pemler 1974 ). Various authors have compared elliptical and spiral galaxies, 
finding that the fluctuation amplitude of ellipticals is stronger than that of 
spirals by a factor of 1.3-1.5 (Santiago & Strauss "19921 ; ILoveday et al. 1996| ; 
Hermit et al. 199(: ; [Guzzo et al. 1997 ). S imilarly, a comparison of the galaxy 



distribution in the IRAS redshift survey ( Strauss et al. 1992b) with those in 
the Center for Astrophysics Redshi ft Survey (CfA; i Huchra et al. 1983 ) and in 
the Optical Redshift Survey (ORS; Santiago et al. 1995| ), shows that optically- 
selected galaxies are clustered more strongly than infrared-selected galaxies by a 
similar factor ( pavis et al. 198"8| ; [Babul fc Postman 1990| ; ^trauss et al. 1992a ). 

All of these statistics, however, only compare the amplitudes of the fluctua- 
tions of each galaxy type. It may be that the pattern of fluctuations is different 
for different galaxy types, as well. That is, that the local density of early-type 
galaxies is not deterministically related to the local density of late-type galaxies; 
other variables may affect the relationship. One way to quantify this is by using 
the correlation coefficient r^i = {5eSi) / OeOu where 5 refers to the galaxy over- 
density and o"^ = ((5^) is its variance, r^i = 1 if the fields are perfectly correlated, 
and = if they are completely uncorrelated. In large-volume hydrodynamic 
simulations of galaxy formation, Blanton et al. (1999) have found a low corre- 
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lation coefficient between tlie density fields of old and young galaxies, around 
~ 0.5. The differences arise in large part from the fact that the dependence 
of each galaxy density field on temperature is different; young galaxies are not 
found in high-temperature regions in the simulations. 

Motivated by these theoretical results, in this proceeding I perform a point- 
by-point comparison of the density fields of different galaxy types, u sing a counts- 
in-cells analysis of the Las Campanas Redshift Survey (LCRS; [Shectman ei 
al. 1996| ). The maximum-likelihood method I use is superior to the comparision 
of second moments alone ( pTegmark &: Bromley 199^ ), as I will demonstrate be- 
low, and can probe the nonlinearity and stochasticity of the relationship between 
different galaxy types. I describe my method in Section 2, describe the nature 
of the data in Section 3, describe my results in Section 4, and show results from 
mock catalogs in Section 5. The results presented at the meeting have been 
amended; in particular, the stochasticity detected between the density fields 
turns out to depend critically on redshift, as explained more completely below. 
I conclude in Section 6. 



2. Method 

I am interested in f{5e,Si), the joint distribution of the overdensities 5e of early- 



type galaxies and 6i of late- type galaxies. Dekel & Lahav (1999) show how such a 
joint distribution determines the relationship between the correlation functions, 
the power spectra, and higher-order moments of two density fields. It is possible 
to constrain the properties of f{6e,6i) using these quantities. However, a more 
direct approach is to analyze the related joint probability distribution P{Ne, Ni) 
of finding early-type and A'^; late-type galaxies in a single cell of size R. After 
all, this latter probability is simply f{5e, Si) (where the fields are smoothed over 
the volume of a cell) convolved with Poisson distributions. If one notes that 
f{6e,Si) = f{6i\6e)fi5e), One can write 

x/d^z ^^'''^'^+^'^''' e-^'(^+^')/(^d^.), (1) 

where iVg and Ni are the average number of galaxies of each type expected in 
a cell of a given volume (and given selection criteria), and a represents the 
parameters of f{6e,di). One can then fit for a by minimizing the quantity 
C = —2^^lnP(Ne^i, Ni^i\a). In practice, I first fit for the parameters of the 
density distribution f{de)', then I fix the parameters of f{Se) and fit separately 
for those of f{6i\5e)- I calculate error bars using the well-known bootstrap 
method. 

I parameterize f{di,6e) by first specifying the density distribution function 
f{Se) and then the bias relation f(6i\6e)- Here, I will use a log-normal model for 
the density distribution function: 

f{6e) = ^ \^ exp [-x2/2ct21 . (2) 
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where Xf. = ln(l + 6^) + crg/2. This model is a better fit to the data than either 
a Gaussian or a first-order Edgeworth expansion ([Bernardeau fc Kofman 1994 ; 



[Juszkiewicz et al. 1995 ; Kim fc Strauss 1998| ); however, the results presented 



here do not depend sensitively on my choice for f{6e 

The bias relation f{6i\5e) can be either deterministic or stochastic. These 
terms are not meant to refer to underlying physical principles but are simply 
meant to express whether knowing the density of ellipticals tells you with cer- 
tainty the density of spirals, modulo Poisson noise. In the case of deterministic 
bias, the joint density distribution can be expressed as 

f{6i\6e) = 5''i6i-b{6e)) (3) 

where f{5e) is the density distribution function of early-type galaxies. Under 
this assumption, one can describe the models by the function b{6e)- 
The simplest model for b{5e) is linear bias: 

b{6e) =bo + bi6e, (4) 

which can be trivially extended to quadratic bias: 

b{6e) = bo + bi5e + b25l (5) 

Another possibility is "broken" bias, which is piece-wise linear with one slope in 
over dense regions and another in under dense regions: 

, / e N ( bo + bi5e for 5e < 
biSe) = [b^ + 6,5^ for 5, > 

I require that {6i) = 0, because it is meant to represent the overdensity of late- 
type galaxies. In practice, this requirement sets 6o) which is therefore not treated 
as a free parameter in any of the above expressions. 

If variables other than the local density field are important in determining 
where galaxies form, it may be that the different formation processes of early- 
type and late-type galaxies cause scatter in their relationship. Thus, I examine 
models which incorporate scatter by rewriting Equation (^, replacing the Dirac 
delta-function with a Gaussian of finite width: 



fi^^e) = r— exp 



{^i - bi5e)f 
2al 



(7) 



where b{5e) and f{5e) are chosen as above. In the case that f{6e) is Gaussian, and 
cr and cTf, are small, such that the limit 5 > —1 is not important, f{6e, Si) reduces 
to a bivariate Gaussian distribution, and the standard correlation coefficient is 
related to ai, by 

r = - i<^b/air (8) 

I will use Equation (|^ to fit linear bias with Gaussian scatter to the relationship 
between galaxy types. 
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3. Galaxies in the LCRS 



The LCRS (Shectman et al. 1996) consists of ~ 25,000 galaxies with a median 
redshift of 2; ~ 0.1. Three long slices (1.5° x 80°) were surveyed in the North 
Galactic Cap, and three in the South Galactic Cap. Within each hemisphere, the 
slices had the same right-ascension limits but were separated by several degrees 
in declination. In addition to the flux limits, a cut was applied on the central 
magnitude (the magnitude within the central two pixels of the CCD images), to 
avoid putting fibers onto galaxies unlikely to yield redshifts. As in all redshift 
surveys, this cut can affect the relationship between the luminosity function and 
the selection function, since the selection is not purely based on flux. 

Bromley et al. (1998) have used a spectral classification scheme to divide 
the galaxies into six "clans." For my purposes, I will split the galaxies into 
just two groups: an early-type group consisting of clans 1 and 2 and a late- 
type group consisting of clans 3 through 6. I place absolute magnitude limits 
on the early-type group of —22.5 < M < —18.8 and on the late-type group of 
—22.0 < M < —18.5. This procedure yields about 10,000 galaxies in each group. 

The geometry of the LCRS complicates an attempt to perform a counts- 
in-cells analysis on it. I create 14 redshift shells, each with an equal volume; 
thus, the shells at higher redshift have a shorter radial extent. In the angular 
dimension, I divide the survey into cells which are 3 spectrograph fields on each 
side (each field is about 1.5° x 1.5°). In the right ascension direction, the fields 
are adjacent; in the declination direction, the fields from the three slices in each 
Galactic hemisphere are combined. This procedure produces 518 cells total, each 
with a volume equivalent to a 15 Mpc sphere, of about cubical dimensions 
at 2: ~ 0.1 (except for the gaps in the declination direction). 



4. Results from the LCRS 

I derive the luminosity and selection functions from the data and calculate the 
expected counts in each cell for each galaxy type. The distribution of A^/A'exp 
is shown in Figure || for each galaxy type. The top quarter of Table || shows 
the results of fitting for the log-normal distribution and each bias model. Note 
first that for the linear fit 61 = 0.76 it 0.02; that is, the late-type galaxies are 
underbiased with respect to the early-type galaxies, to a degree which agrees 
with conventional wisdom. Some nonlinearity is detected in the quadratic bias 
case, at rather high significance (6a"), in the sense that the slope of the bias 
steepens at large densities (i.e. 62 > 0). The broken bias model also shows this 
effect, at somewhat lower significance. Stochasticity is detected at about lOcr 
significance, with at = 0.21 it 0.02. This level of stochasticity corresponds to r ~ 
0.87; for comparison, the moments method of [Tegmark fc Bromley (1999) would 



estimate r for this counts-in-cells distribution to be r = 0.73 it 0.01. Apparently 
a considerable amount of the "stochasticity" measured by the moments method 
is due to an inadequate characterization of the distribution of the densities, 
most likely because the method does not account for the non-Gaussianity of the 
Poisson distribution at low A'^ or for the lower limit of 5 > — 1. 

The likelihoods relative to the linear fit are also shown in Table |l| for all 
the other models, and indicate that the stochastic linear bias is clearly the best 
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Figure 1. Joint overdensity distribution of early- type galaxy (x- 
axis) and late-type galaxy (y-axis) counts-in-cells for the LCRS. The 
low-redshift cells are shown as solid squares and the high-redshift cells 
are shown as crosses. Note that it appears as if the low-redshift cells are 
systematically less biased than the high redshift cells, which accounts 
for the increased stochasticity and nonlinearity when these cells are 
included. This low bias could occur because the selection function for 
the late-type galaxies is overestimated at low redshifts. 

model I have considered. I have also compared each of the two-parameter fits 
to the linear fit by using a likelihood ratio test, whose results I show in the last 
column as the probability Prandom of getting the likelihood difference between 
the linear model and the two-parameter model by chance. Note that since I 
only ran 200 realizations for each estimate, there is a lower limit on ^random of 
0.005. From these results, it is clear that I detect nonlinearity at a statistically 
significant level, and stochasticity at an extremely significant level. 

As part of an effort to test for systematic effects, I split the cells into a 
high-redshift half and a low redshift half. Results for each half are shown in the 
middle two quarters of Table I find that the low redshift half retains the large 
amount of nonlinearity and stochasticity, while the high redshift half has no 
detectable signal. I can be more precise and eliminate the two innermost rings 
of cells, and fit to the rest, as shown in the bottom quarter of Table |l|. This 
set of cells shows no nonlinearity, and a much reduced stochasticity, indicating 
that most of the signal for stochastic bias was coming from the two innermost 
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Table 1. Properties of PDF and bias fits to LCRS joint counts- in- 
cells of early and late type galaxies, for different redshift selections. 

cz Range (km/s) (Je cr; Bias Model h\ 62 or (T;, 



10, 000-46, 000 


0.54 ±0.02 


0.41 ±0.02 


Linear 


0.76 ± 0.02 




0.0 


N/A 








Quadratic 


0.73 ± 0.03 


0.18 ± 0.03 


-13.1 


< 0.005 








Broken 


0.64 ± 0.05 


0.89 ± 0.05 


-5.2 


0.020 








Stochastic 


0.63 ± 0.05 


0.21 ± 0.02 


-106.0 


< 0.005 


10, 000-36, 000 


0.50 ±0.03 


0.39 ±0.02 


Linear 


0.75 ± 0.03 




0.0 










Quadratic 


0.73 ± 0.04 


0.19 ± 0.05 


-8.2 


0.005 








Broken 


0.66 ± 0.05 


0.85 ± 0.06 


-1.8 


0.165 








Stochastic 


0.55 ± 0.06 


0.22 ± 0.02 


-119.7 


< 0.005 


36, 000-46, 000 


0.61 ±0.03 


0.46 ±0.03 


Linear 


0.79 ± 0.05 




0.0 










Quadratic 


0.81 ± 0.06 


-0.06 ± 0.05 


-0.6 


0.420 








Broken 


0.80 ±0.11 


0.78 ± 0.08 


-0.0 


0.955 








Stochastic 


0.78 ± 0.04 


0.07 ±0.05 


-0.2 


0.370 


24, 000-46, 000 


0.56 ±0.03 


0.43 ±0.03 


Linear 


0.81 ± 0.03 




0.0 










Quadratic 


0.81 ± 0.04 


0.00 ±0.03 


-0.0 


0.990 








Broken 


0.82 ± 0.07 


0.80 ±0.05 


-0.0 


0.920 








Stochastic 


0.77 ±0.04 


0.13 ± 0.03 


-8.9 


< 0.005 



rings of cells; tests show that this effect is not due to the better signal-to-noise in 
the inner rings. Again I can express the stochasticity in terms of the correlation 
coefficient r ~ 0.95; the moments method of [Tegmark fc Bromley (1999) obtains 
r = 0.93 ± 0.03 for this set of cehs. 

In order to understand this effect, reconsider Figure [l|, where I have marked 
the cells in the inner two rings with square boxes. It is clear that these cells 
have a smaller bias compared to the other cells. This probably indicates that the 
selection function is overestimated for late- type galaxies at low redshifts. I have 
performed the same analyses using selection functions based on the luminosity 
functions of Bromley et al. (1998), and find the same effect. 



5. Results from Mock Catalogs 

Because of the peculiar geometry and selection effects of the LCRS it is nec- 
essary to test these results against mock catalogs where I have simulated the 
observational effects inherent in the survey. I would also like to understand 
whether some of the systematic trends with redshift found in the last section 
can be explained by observational effects. I run particle-mesh simulations of a 
300 h-^ Mpc box using 256^ particles and 512'^ grid cells. I use the flat model 
with = 0.4 and Q\ = 0.6. To select the late-type galaxies, I simply pick 
dark matter particles at random. To select the early-type galaxies, I smooth 
the density field with a 3 Mpc Gaussian filter, and apply a threshhold of 
Sc,e = 0.25; every dark matter particle above the threshold has an equal proba- 
bility of becoming an early-type galaxy. 

To create realistic mock catalogs, I pick a random particle in the simulation 
to represent the observer. I then "observe" the galaxies in the simulation box, 
using the angular and photometric limits of the LCRS, as well as the number 
of fibers available in each field ( phectman et al. 1996| ). Furthermore, there is a 
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Table 2. Properties of PDF and bias fits to realistic mock catalogs 
and volume-limited mock catalogs. 

Catalog Type c; Bias Model h\ 62 or 0-5 C 



Volume-limited 0.85 ± 0.02 0.54 ± 0.01 



Flux-limited 0.66 ± 0.03 0.46 ± 0.02 



Linear 


0.72 ±0.01 




0.0 


Quadratic 


0.74 ±0.01 


-0.01 ±0.00 


-5.2 


Broken 


0.67 ±0.01 


0.79 ±0.02 


-14.3 


Stochastic 


0.72 ±0.01 


0.06 ±0.01 


-8.8 


Linear 


0.76 ±0.02 




0.0 


Quadratic 


0.77 ±0.02 


-0.05 ±0.01 


-2.8 


Broken 


0.70 ± 0.04 


0.82 ±0.05 


-2.0 


Stochastic 


0.74 ±0.03 


0.07 ±0.02 


-2.7 



probability of failing to observe the galaxy which is a function of its magnitude, 
determined from the data. In the real LCRS, fibers could not be placed more 
closely than 55"; I implement that restriction in the mock catalogs as well. 
Finally, I include the appropriate magnitude and redshift errors. 

As a benchmark, I take the simulation and divide it into cubic cells about 
25 hr^ Mpc on a side. I subsample the galaxies such that there are about 20-30 
galaxies of each type in each cell. I refer to this sample as the volume-limited 
catalog. It is free of all of the selection effects associated with the real survey, as 
well as redshift-space distortions. The cells are equivalent in volume to the cells 
described in Section 3. I will evaluate the degree to which the selection effects 
affect the results by comparing realistic mock catalogs to the results for these 
volume- limited cells. 

The results of this comparison are listed in Table ^. There is a significant 
difference between the variances measured for the volume-limited catalog and 
for the realistic mock catalog. I believe this is simply due to the fact that the 
non-cubical shapes of the cells in the LCRS allow the cells to probe effectively 
larger scales, thus driving down the variances. Tests show that the effect is 
not due to the incomplete sampling or due to fiber collisions. The important 
result, however, is that the bias is qualitatively the same (though with some 
quantitative differences). In particular, there is no increased for the mock 
catalogs; the ratio cTfe/o"; is about 50% higher for the mock catalogs than the 
volume-limited catalogs, but this level is still low enough for us to conclude 
that the stochasticity I observe in the real catalog is not caused by any of the 
observational effects modeled here. 



6. Conclusions 

I have presented a powerful method of analyzing the nature of the relationship 
between different galaxy types. As a probe of stochasticity it is superior to 
calculating second moments; in addition, it can measure nonlinearity. It would 
be both simpler to implement and less susceptible to the problems of selection 
I have run into here if it were used to analyze a volume-limited survey, which I 
intend to do using the Optical Redshift Survey. 
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I have interpreted the redshift-dependence of the results as a problem in the 
understanding of the selection effects of the survey. The outstanding selection 
effect which I have not modeled yet in my mock catalogs is the central magnitude 
cut. I am in the process of determining whether the effect on the selection 
function is strong enough to cause the redshift-dependence found in Table |l[ 
Note, importantly, that the results of Tegmark & Bromley (1999) are susceptible 



to the same problems I have found here, and should be interpreted accordingly. 

In conclusion, the result of this work I take most seriously is the bottom 
quarter of Table 0, which excludes the two innermost rings, and indicates a bias 
which is linear, with perhaps some mild scatter, and an amplitude of bi ~ 0.8. 
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